On the Reproducibility of the TAGME Entity Linking System

نویسندگان

  • Faegheh Hasibi
  • Krisztian Balog
  • Svein Erik Bratsberg
چکیده

Reproducibility is a fundamental requirement of scientific research. In this paper, we examine the repeatability, reproducibility, and generalizability of TAGME, one of the most popular entity linking systems. By comparing results obtained from its public API with (re)implementations from scratch, we obtain the following findings. The results reported in the TAGME paper cannot be repeated due to the unavailability of data sources. Part of the results are reproducible through the provided API, while the rest are not reproducible. We further show that the TAGME approach is generalizable to the task of entity linking in queries. Finally, we provide insights gained during this process and formulate lessons learned to inform future reducibility efforts.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Transitive Closure on the Calibration of Logistic Regression for Entity Resolution

This paper describes a series of experiments in using logistic regression machine learning as a method for entity resolution. From these experiments the authors concluded that when a supervised ML algorithm is trained to classify a pair of entity references as linked or not linked pair, the evaluation of the model’s performance should take into account the transitive closure of its pairwise lin...

متن کامل

Estimating the Parameters for Linking Unstandardized References with the Matrix Comparator

This paper discusses recent research on methods for estimating configuration parameters for the Matrix Comparator used for linking unstandardized or heterogeneously standardized references. The matrix comparator computes the aggregate similarity between the tokens (words) in a pair of references. The two most critical parameters for the matrix comparator for obtaining the best linking results a...

متن کامل

Effect of Some Synthetic Parameters on Size and Polydispersity Index of Gelatin Nanoparticles Cross-Linked by CDI/NHS System

In our previous work, the effect of use of a water soluble CDI/NHS system as nontoxic cross-linking agent on fabrication of gelatin nanoparticles was investigated. In this research, the effect of variation in some synthetic parameters of gelatin nanoparticles cross-linked by CDI/NHS system such as type of gelatin and formulation of cross- linking agent on their size and distribution was examine...

متن کامل

Inverse Miniemulsion Method for Synthesis of Gelatin Nanoparticles in Presence of CDI/NHS as a Non-toxic Cross-linking System

In this research, gelatin nanoparticles were synthesized via inverse miniemulsion method by employing a mixture of a water soluble carbodiimide (CDI) and N-hydroxysuccinimide (NHS) as a non-toxic cross-linking system. The gelatin nanoparticles were characterized for their size and size distribution, morphology and stability and were compared with those of nanoparticles cross-linked by glutarald...

متن کامل

TAGME: On-the-fly Annotation of Short Text Fragments

We designed and implemented Tagme, a system that is able to efficiently and judiciously augment a plain-text with pertinent hyperlinks to Wikipedia pages. The specialty of Tagme with respect to known systems [5, 8] is that it may annotate texts which are short and poorly composed, such as snippets of search-engine results, tweets, news, etc.. This annotation is extremely informative, so any tas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016